Efficient Summarization of Stereoscopic Video Sequences

نویسندگان

  • Nikolaos D. Doulamis
  • Anastasios D. Doulamis
  • Yannis S. Avrithis
  • Klimis S. Ntalianis
  • Stefanos D. Kollias
چکیده

An efficient technique for summarization of stereoscopic video sequences is presented in this paper, which extracts a small but meaningful set of video frames using a content-based sampling algorithm. The proposed video-content representation provides the capability of browsing digital stereoscopic video sequences and performing more efficient content-based queries and indexing. Each stereoscopic video sequence is first partitioned into shots by applying a shot-cut detection algorithm so that frames (or stereo pairs) of similar visual characteristics are gathered together. Each shot is then analyzed using stereo-imaging techniques, and the disparity field, occluded areas, and depth map are estimated. A multiresolution implementation of the Recursive Shortest Spanning Tree (RSST) algorithm is applied for color and depth segmentation, while fusion of color and depth segments is employed for reliable video object extraction. In particular, color segments are projected onto depth segments so that video objects on the same depth plane are retained, while at the same time accurate object boundaries are extracted. Feature vectors are then constructed using multidimensional fuzzy classification of segment features including size, location, color, and depth. Shot selection is accomplished by clustering similar shots based on the generalized Lloyd–Max algorithm, while for a given shot, key frames are extracted using an optimization method for locating frames of minimally correlated feature vectors. For efficient implementation of the latter method, a genetic algorithm is used. Experimental results are presented, which indicate the reliable performance of the proposed scheme on real-life stereoscopic video sequences.

منابع مشابه

An Optimal Framework for Summarization of Stereoscopic Video Sequences

In this paper an optimal framework for summarization of stereoscopic video sequences is presented, which extracts a meaningful set of video frames. Each sequence is first partitioned into shots, the disparity field, occluded areas and depth map are estimated and then a hierarchical color and depth segmentation scheme is applied to each shot, based on a multiresolution implementation of the RSST...

متن کامل

Unsupervised Semantic Object Segmentation of Stereoscopic Video Sequences

In this paper, we present an efficient technique for unsupervised semantically meaningful object segmentation of stereoscopic video sequences. By this technique we achieve to extract semantic objects using the additional information a stereoscopic pair of frames provides. Each pair is analyzed and the disparity field, occluded areas and depth map are estimated. The key algorithm, which is appli...

متن کامل

Segmentation of Sequences of Stereoscopic Images for Modelling Artificial Muscles

In this paper, an implementation of the Region Competition algorithm for segmenting stereoscopic video sequences is shown. This algorithm is an essential task in the method in order to obtain a 3D characterization of artificial muscles. Image sequences are acquired by a two-cam computer vision system. Optimal and efficient segmentation of these images is our goal; information obtained from the ...

متن کامل

Optimized Region Competition Algorithm Applied to the Segmentation of Artificial Muscles in Stereoscopic Images

This paper addresses the implementation of the Region Competition algorithm for segmenting stereoscopic video sequences. The segmentation performed by this algorithm is an essential stage for the 3D characterization of artificial muscles. Image sequences are acquired by a two-cam computer vision system. The aim of this work is to come up with optimal and efficient segmentation of these images; ...

متن کامل

Compatible Video Coding of Stereoscopic Sequences using MPEG-2's Scalability and Interlaced Structure

Three approaches of an MPEG-2 compatible coding technique are presented for stereoscopic sequences. The rst method utilizes the spatial scalability structure and the second employs the temporal scalability syntax. The scalability extensions of the video coding standard make the processing easier to accommodate the transmission of a stereoscopic video stream. The left and right channels required...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000